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(57) Abstract 

The power consumed within an integrated circuit (IC) is reduced by throttling the performance of particular functional units (105) 
within the IC. The recent utilization levels of particular functional units within an IC are monitored (108), for example, by computing 
each functional unit's average duty cycle over its recent operating history (106). If this activity level (109) is greater than a threshold the 
functional unit is operated in a reduced-power mode (1 10). The threshold value is set large enough to allow short bursts of high utilization 
to occur. An IC can dynamically make the tradeoff between high-speed operation and low-power operation, by throttling back performance 
of runcaonal units when their utilization exceeds a sustainable level. This dynamic power/speed tradeoff can be optimized across multiple 
functional units within an IC or among multiple ICs within a system. This dynamic power/speed tradeoff can be altered by providing 
software control over throttling parameters. 7 F 5 
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Localized Performance Throttling 
To Reduce IC Power Consumption 

5 FIELD OF THE INVENTION 

The invention relates generally to reducing the power 
consumption of Integrated Circuits (ICs), and particularly of Very Large 
Scale Integration (VLSI) ICs. In particular, it relates to methods and 
apparatus for throttling the performance of particular functional units 
1 0 within an IC as needed to control worst-case power consumption. 

BACKGROUND OF THE INVENTION 

Reducing the power consumed by an IC has significant 
advantages: (1) Less power must be supplied to the IC; and (2) Less 

1 5 heat must be dissipated by the IC and the devices surrounding it. 
Reducing power consumption is especially important when an IC is 
going to be used in a portable computing device, such as a hand-held 
or notebook-size digital device. 

Portable devices often operate for extended periods of time using 

20 only the power supplied by an internal battery. Because the size, weight 
and storage capacity of a portable battery is very limited, conserving 
power is critical in portable devices. The less power its ICs consume, 
the longer time the portable device can operate without changing or 
recharging its batteries. 

25 Further, portable devices generally must dissipate the heat that 

their components generate without the assistance of the mechanical 
heat sinks or radiators and cooling fans that can easily be used in a 
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desk-top or rack-mount computer system. When the ICs within a 
portable device consume less power, it operates at a lower temperature. 
Elevated temperatures within a computing device can make its 
components operate unreliably or have shorter lifetimes. 
5 The power consumed by an IC can be reduced by lowering the 

speed at which it operates. For an IC fabricated using CMOS 
technology, which dominates the manufacture of commercial ICs, the 
power the IC consumes is directly proportional to both its clock rate and 
its operating voltage. If either clock rate or voltage is reduced, then the 

10 power consumed is reduced. Reducing the voltage also requires 

lowering the clock rate, unless an offsetting improvement is made in the 
manufacturing technology. 

Because typically a fixed number of clock cycles is required to 
perform a particular operation, approaches to reducing IC power 

1 5 consumption that reduce the clock rate of the IC unfortunately also 
reduce performance. Thus, there is a need to reduce the power 
consumed by an IC without reducing its performance. 

For many complex ICs, the power consumed varies widely with 
the task that they are performing. If more of the circuit nodes within the 

20 IC transition from one to zero or visa versa, then more power is 

consumed. Thus in order to specify the typical power consumption of a 
particular IC, it is necessary to define a benchmark sequence of 
operations that constitutes its typical usage. Such a benchmark would 
likely include substantial amounts of idle time, because computing 

25 devices designed for interactive use spend a large percentage of time 
waiting for user input. Once such a benchmark suite of typical 
operations is defined, then the power consumed by an IC in performing 
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those operations can be measured or estimated. Such a typical power 
consumption value would be useful, for example, in estimating the 
battery life of a portable computing device under normal use. 

It is desirable to reduce the power consumed by an IC by 
5 reducing or eliminating node transitions in functional units within the IC 
that are not being used during a particular sequence of operations. If an 
IC shuts down functional units when they are not being used, then 
typical power consumption can be significantly reduced with little or no 
impact on performance. 

1 0 However, shutting down functional units is likely to have little 

impact on worst-case power consumption, which often arises when the 
IC is performing sequences of operations that utilize many of the 
functional units within the IC. Worst-case power consumption is likely to 
be substantially higher than typical power consumption. 

1 5 Qften particular functional units or logic blocks within an IC can 

be identified that tend to consume a disproportionate share of the IC's 
power « for example, the circuitry in a microprocessor that performs 
floating-point arithmetic. The power consumed by a microprocessor is 
significantly less if it is not called on to perform many floating-point 

20 operations. 

The worst-case power consumption of a microprocessor might 
involve a sequence of floating point operations that operates on data 
values chosen to maximize node transitions from one to zero and visa 
versa, and that executes repeatedly using cache memory within the 

25 microprocessor so as to avoid reading or writing main memory. 

Additionally, if the microprocessor performs speculative evaluations of 
upcoming operations based on predicting which way a branch 
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operation will go, power consumption would be increased by increasing 
the percentage of branch operations for which the microprocessor's 
prediction is accurate. This is because an inaccurate prediction flushes 
the instruction-execution pipeline, thus leaving some functional units 
5 idle as the pipeline refills. 

The designer of the system in which the IC is to be used must 
know what the maximum power consumed by the IC will be for any 
possible sequence of operations. In order to make a system that 
incorporates an IC robust, the IC's maximum worst-case power must be 

1 0 known and specified. Reducing the worst-case power consumption of 
an IC is very important for reliability purposes, for heat dissipation 
purposes and for power-supply capacity purposes. Thus, there is a 
need to reduce the worst-case power consumed by an IC with little or no 
reduction in performance. 

15 A worst-case sequence of operations, as described above, is 

important for estimating worst-case power consumption, which is 
essential for the above-mentioned purposes. But such a sequence can 
be considered artificial, i.e. it may not be encountered in practical 
applications of a microprocessor. For example, it is artificial to use a 

20 worst case power sequence based on lots of floating point computations 
in rating a microprocessor to be used in a portable computing device 
where floating point operations are infrequently used. It may not be 
important in typical applications of portable computing devices that long 
sequences of floating-point arithmetic be performed at maximum speed. 

25 If the performance of typical operations is maintained, then it may 

be acceptable to throttle back the performance of less typical or artificial 
sequences of operations for the sake of reducing power. Thus, there is 
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a need to reduce the worst-case power consumed by an IC without 
reducing performance for normal applications. 

Summary of the Invention 

5 A novel method and apparatus for controlling power consumption 

within an IC reduces worst-case power consumption without 
substantially lowering performance for typical applications. Worst-case 
power consumption is reduced by throttling down the activity levels of 
long-duration sequences of high-power operations. 

10 Within any IC, a number of particular functional units can 

consume inordinate amounts of power. For example, floating-point 
= arithmetic units and cache memories are two types of functional units 
within a microprocessor IC that can consume substantial amounts of 
power. The invention allows IC designers to identify any number of 

1 5 such high-power functional units within the IC they are designing, and 
place each under the control of its own power controller. Further, the 
invention allows IC designers to place the IC they are designing as a 
whole under the control of an overall power controller. In the case of a 
microprocessor IC, the power consumption as a whole can effectively be 

20 throttled by lowering either the instruction retirement rate or the 
instruction issue rate. 

In one embodiment, the power controller comprises an activity 
monitor and a mode controller. The activity monitor tracks the recent 
utilization level of a particular functional unit within the IC - for example, 

25 by computing its average duty cycle over its recent operating history. If 
this activity level is greater than a threshold, then the mode controller 
switches the functional unit to operate in a reduced-power mode. The 
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threshold value is set large enough to allow short bursts of high 
utilization to occur without impacting performance. 

Embodiments of the invention exist that add only minimal cost 
and complexity to the IC's design -- for example, one up-down counter 
5 and some control circuitry per each functional unit being controlled. On 
the other hand, the invention is flexible in that it encompasses a wide 
variety of techniques for monitoring utilization of different functional 
units, for reducing the power they consume and for setting their throttling 
parameters. 

0 In accordance with another aspect of the invention, the dynamic 

power/speed tradeoff of the invention can be optimized across multiple 
functional units within an IC or among or among multiple ICs within a 
system. The invention includes optimization schemes wherein the 
maximum power consumed by a particular functional unit can be 

5 increased or decreased depending on the power being consumed 
elsewhere within the same IC, or on other ICs within the same system. 

In accordance with another aspect of the invention, the dynamic 
power/speed tradeoff of the invention can be controlled by software, 
such as platform software executing at system boot time, or operating 

0 system software, or possibly even applications software. 

Brief Descript ion of the Drawings 

The invention is illustrated in the following drawings, in which 
known circuits are shown in block-diagram form for clarity. These 
5 drawings and the following textual description are for explanation and 
for aiding the reader's understanding, but the invention should not be 
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taken as being limited to the preferred embodiments and design 
alternatives illustrated therein. 

Figure 1(a) shows the blocks of logic circuitry of the invention. 
Figure 1(b) is a state diagram showing the transitions of a 
5 functional unit from its normal mode or state to its reduced-power mode 
and back again, according to the invention. 

Figure 2 shows the blocks of logic circuitry in an embodiment of 
the invention that enforces a 50% maximum sustainable duty cycle on a 
floating point functional unit. 
1 0 Figure 3 shows the blocks of logic circuitry in an embodiment of 

s the invention that enforces a programmable maximum sustainable duty 
L? cycle on a cache memory. 

Figure 4 shows the blocks of logic circuitry in an embodiment of 
the invention that disables instruction cache prefetching based on the 
1 5 recent utilization level of the data cache. 

Figure 5 shows the blocks of logic circuitry in an embodiment of 
the invention where a power coordinator reads the activity levels of 
various functional units within an IC and alters, based on those activity 
levels, the throttling parameters of other functional units to dynamically 
20 optimize the power/speed tradeoff. 

Detailed Description of the Invention 

Overview 

The invention allows an IC to dynamically make the tradeoff 
25 between high-speed operation and low-power operation, by throttling 
back performance of a functional unit when its recent utilization exceeds 
a sustainable level. Thus, the invention allows the IC to dynamically 

OC1D <WO 99171S6A1 I > 
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throttle back the execution rate of maximum worst-case power 
consumption sequences of operations so as to not exceed the worst- 
case power consumption allowable, thus avoiding reliability, heat 
dissipation or power supply problems. 
5 At the same time, the invention minimizes any performance 

impact that such throttling has on realistic sequences of operations. 
This power reduction is done in a way that does not have a substantial 
affect on the performance of the IC for typical tasks. The localized 
control and the threshold value that the invention provides minimize 
10 performance impacts. Further, the performance impact is predictable 
and repeatable for those sequences of operations that the invention 
does throttle. 

The purpose of an IC is not to run some artificial, non-realistic 
maximum worst-case power consumption sequence of operations at 

15 high performance. Rather, it is to run realistic or typical sequences of 
operations at high performance. In some cases, there can be a 
substantial difference in power consumption between such the typical 
worse case power consumption and the artificial worst-case power 
consumption. The effectiveness of adding the invention to a particular 

20 IC design depends on the amount of difference between that design's 

artificial worst-case power consumption and its typical worst-case power 
consumption. 

A preferred way to look at typical worst-case power consumption 
is to look at realistic sequences of operations typically used to perform 
25 actual work and identify from among those sequences the particular 

sequence that maximizes power consumption. Such a sequence could 
be determined by profiling the power consumption of sequences of 
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operations in a mix of popular software programs, and choosing from 
among those sequences the sequence with the highest power 
consumption. 

According to the present invention, artificial sequences of 
5 operations that keep high-power functional units active for longer than a 
threshold are performed in low power mode. Thus, the invention 
prevents the IC from consuming power in excess of its specified 
maximum regardless of the sequence of operations it is performing. 
This is critical in the case of malicious software, such as a virus, that 
1 0 might deliberately attempt to damage a microprocessor IC or the system 
that includes the microprocessor by causing excess power 
consumption. 

The invention is independent of the technique of reducing overall 
power consumption by reducing voltage and/or clock rate. It can be 

15 used in conjunction with that approach, or in lieu of that approach. For 
example, if an IC would operate at 100 MHz except for excessive worst- 
case power consumption at that speed, then (i) the clock rate could be 
lowered to reduce worst-case power consumption; (ii) the invention 
could be employed to reduce worst-case power consumption; or (iii) a 

20 combination of both techniques could be employed. For some power- 
limited designs, using the invention could make the difference in 
whether or not a particular target clock rate can be met. 

Figure 1(a) is a block diagram of one embodiment of the 
invention. Functional unit 105 provides current activity information 108 

25 to activity monitor 106. Current activity information 108 describes what 
tasks or operations functional unit 105 is currently performing, or 
indicates that it is currently idle. Based on this current activity 
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information 108, activity monitor 106 generates activity level 109, and 
provides it to mode controller 107. Activity level 109 could be a number, 
a set of signals each indicating that the activity level is within a specified 
range, or even a single bit. Based on activity level 109, mode controller 
5 107 generates mode control signal 1 10, which is coupled to functional 
unit 105. 

Mode controller 107 switches functional unit 105 between a 
normal mode of operation 101 (typically one with high performance and 
high power consumption), and a reduced-power mode 102 (typically 

10 one lower in performance and lower in power consumption). 

Activity monitor 106 monitors the recent utilization of functional 
unit 105, via activity level 109. Activity level 109 could be a special 
signal generated by functional unit 105, or it could simply be the 
commands that functional unit 105 receives and responds to. 

1 5 Monitoring the recent utilization could consist of, for example, computing 
the average duty cycle of the functional unit over the preceding 
thousand cycles. If this activity level exceeds a threshold, then mode 
controller 107 places functional unit 105 in reduced-power mode. 
Further, if it is desired to monitor the overall power consumption of an IC, 

20 then its substrate temperature could be measured and this value used 
as the activity level of the invention. 

Figure 1(b) is a state-transition diagram of the operation of the 
invention. It shows how mode controller 107 causes functional unit 105 
to transition between normal mode 101 and reduced-power mode 102. 

25 When the functional unit is in normal mode 101 and the recent utilization 
is greater than the threshold, then transition 103 occurs in which the 
mode controller places the functional unit in reduced-power mode 102. 



WO 99/17186 ^ PCT/US97/17492 



Similarly, when in reduced-power mode 102 and the recent utilization is 
less than the threshold, then the mode controller takes transition 104 to 
restore the functional unit it controls to normal mode 101. 

Preferably, the threshold value used is set based on profiling the 
5 realistic worst-case power consumption benchmark being used in the 
design of this particular IC. The threshold is preferably set large enough 
that all or most bursts of high activity occurring in this benchmark are 
shorter than this threshold, and thus can be speedily executed with little 
or no throttling. 

10 In the case whelre heat dissipation is the primary determinant of 

: how much power can be consumed, the threshold may be on the order 

of a hundred thousand (100,000) operations. A spike in power 
~ consumption of one millisecond (1 ms) may well be tolerable from a 
thermal point of view. If the IC is clocked at 100 MHz, then a 1 ms spike 
1 5 is 100,000 clock cycles. A substantial amount of high-speed 

computation can be performed in a high-power burst of 100,000 clock 
cycles. Thus, the invention allows bursts of high activity, unless their 
duration exceeds the threshold. 

20 Design Alternatives for Monitoring Utilization 

The invention is flexible in that it encompasses a wide range of 
methods and devices for monitoring activity levels. These design 
alternatives range from very simple to quite complex. In fact, each 
functional unit controlled may have a different monitoring technique to 
25 which it is best suited. 

A particularly simple monitoring technique is to use an up/down 
counter as an activity-level register whose contents indicate the current 

XrtD" <WO 9£>l7ie6A1 I > 
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utilization of the functional unit being monitored. In a simple 
implementation, the up/down counter increments its contents by one 
during each clock cycle that the functional unit is active and decrements 
its contents by one for each clock cycle the functional unit is inactive. A 
5 slightly more complex design alternative is to increment and decrement 
not for each clock cycle, but rather once per each complex operation 
that the functional unit performs and decrement for each corresponding 
period that the functional unit is inactive. Another design alternative is 
for the activity monitor to increment by a value other then one, to 

1 0 decrement by a value other than one, or both. 

If the value by which the contents of the activity-level register is 
increased during each active cycle equals the value by which the 
activity-level register is decreased during each inactive cycle, then the 
activity monitor functions to enforce a maximum sustainable duty cycle 

1 5 of fifty percent (50%). In an up-down counter implementation, care must 
be taken that the contents of the activity-level register never go below 
zero, or alternatively that a negative number as the value in the activity- 
level register is distinguished from a roll-over condition in which the 
value becomes too large in the positive direction. 

20 The current value of the activity-level register is compared against 

a threshold value. The threshold value is independent of the maximum 
sustainable duty cycle. It is set so as to be large enough so that short 
bursts of high activity can execute at full speed. Preferably, the 
threshold value is set by profiling the sequence of operations selected 

25 as the realistic worst-case power consumption benchmark. The 

threshold value can be thought of as a deficit limit which the functional 
unit can not exceed without having its speed throttled down. Carrying 
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this analogy further, the current value of the activity-level register can be 
thought of as its current power deficit. 

If a maximum sustainable duty cycle value other than fifty percent 
(50%) is desired, then it is necessary to have the active increment be 
unequal in magnitude to the inactive decrement. For example, an 
increment of two and a decrement of one produce a thirty-three (33%) 
percent maximum sustainable duty cycle. The sustainable duty cycle is 
given by Equation 1: 

ID 

Equation 1: SDC = — 

(ID+AI) 

SDC represents the sustainable duty cycle, Al represents the active 
increment amount and ID represents the inactive decrement amount. In 
Equation 1, Al and ID are each positive and represent the absolute 
value of the increment and decrement values actually used. Preferably 
the active increment value is positive and the inactive decrement value 
is negative. 

If the active increment, Al, value is chosen to be one, then the 
maximum number of consecutive cycles that the functional unit can be 
active is equal to the threshold value. In general, the maximum burst 
length is given by Equation 2: 

TH 

Equation 2: MBL = — 

Al 

MBL represents the number of functional-unit cycles in the maximum 
burst length and TH represents the threshold value used to compare 
with the current activity value. 

More sophisticated activity monitoring schemes are possible 
within the scope of the invention. For example, the type of operation the 
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functional unit is asked to perform could be monitored by an activity 
monitor that associated a particular activity increment with each possible 
type of operation. In such a scheme, the contents of the activity-level 
register could simply decrement at a constant rate. 

5 

Design Alternatives for Reducing Power 

The invention is flexible in that it encompasses a wide range of 
design alternatives for reducing the power of the functional unit that it 
controls. These design alternatives can range from very simple to quite 

1 0 complex. In fact, each functional unit controlled may have a different 
power reduction technique for which it is most suited. 

A simple way to reduce the power consumed by the functional 
unit is to reduce its clock rate. This could be performed by dividing the 
clock which it normally receives by two, or by suppressing every other 

1 5 clock pulse. In the case where the maximum sustainable duty cycle is 
fifty percent, then dividing the clock provided to the functional unit by two 
when the threshold is exceeded enforces this maximum duty cycle. 
Alternatively, the clock rate could be reduced by a factor other than two. 
Many ICs include cache memory to keep an internal, and thus 

20 quickly assessable, copy of data that is available at a slower speed in 
some type of external memory. Cache memory is used for performance 
reasons. Much less delay is involved in accessing the information from 
an on-chip cache than in accessing it from a device external to the IC. 
Cache memories can be major consumers of power within an IC. 

25 Thus, it may be desirable to place on-chip cache memories under the 
control of the invention. A simple scheme for reducing the power 
consumption of the cache memory is to force access to the external 
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memory (even if a copy of the data is present in the on-chip cache), 
when necessary, to reduce power consumption because the cache 
memory's maximum sustainable duty cycle has been exceeded. 

It will be clear to one skilled in the art that an IC may have other 
5 on-chip functional units whose functions can be performed at lower 
speed by off-chip circuits. These are candidates for the same power 
reduction technique as used for cache memory - that is, have the off- 
chip circuit perform the operation when needed to reduce on-chip power 
consumption. 

1 0 In rare cases it may be cost effective to include on an IC two 

v complete implementations of a particular functional unit — one being 
2 high speed and high power and the other being low speed and low 
power. In this case, the mode controller of the invention selects which is 
to be used based on the current utilization of the functional unit and the 
1 5 current value of its threshold parameter. 

In the case of a microprocessor that performs speculative 
instruction execution, instructions are started through the instruction- 
evaluation pipeline anticipating that a conditional branch instruction will 
(or will not) be taken. If the prediction as to whether or not the branch is 
20 taken is correct, then a significant performance speed-up is achieved. 
But sometimes the branch prediction is wrong and as soon as this is 
known, then the results of the speculative evaluations are discarded and 
the correct instructions are started through the instruction-evaluation 
pipeline. A preferred reduced-power mode for a microprocessor that 
25 performs speculative instruction execution may be to reduce or 
eliminate speculative instruction elimination. 
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Another example of speculative operation is cache prefetching. 
Many ICs with on-chip cache memories anticipate that instruction or 
data memory accesses will be sequential. To increase performance, 
they prefetch to the instruction or data cache some number of words 
5 adjacent to the currently requested instruction or data address. A 
preferred reduced-power mode for a cache memory may comprise 
disabling some or all of its speculative prefetches. In general, a 
preferred reduced-power mode for any functional unit may be to reduce 
or eliminate its speculative activities. 

10 

Controlling a Floating-Point Arithmetic Unit bv an Up/Down 
Counter 

Figure 2 shows an embodiment of the invention that enforces a 
maximum sustainable duty cycle of fifty percent (50%) on floating-point 

1 5 unit 206. In its normal operating mode, multiplexor 203 passes system 
clock 201 on to the clock input of floating-point unit 206. In its reduced- 
power-mode, multiplexor 204 passes the output of divide-by-two circuit 
202 on to the clock input of floating-point unit 206, thus cutting both its 
speed and power consumption in half. 

20 Floating-point unit 206 provides active signal 207 to the up/down 

control input of up/down counter 205. For each cycle of system clock 
201 for which active signal 207 is true, up/down counter 205 increments 
its contents by one. For each cycle of system clock 201 for which active 
signal 207 is false, up/down counter 205 decrements its contents by 

25 one. If a decrement would take its contents below zero, then up/down 
counter 205 stays at zero. 
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The select input of multiplexor 203 is driven by most-significant- 
bit output 204 from up/down counter 205. This bit provides the feedback 
to control whether or not the invention places floating point unit 206 into 
reduced power mode. Thus in this embodiment, the threshold is 
5 predetermined and must be a power of two. Which power of two is used 
is selected by the number of bits in up/down counter 205. When the 
contents of up/down counter 205 is large enough that its most significant 
bit is a one, then the reduced-power mode is entered and multiplexor 
203 selects the output of divide by two circuit 202 to clock floating-point 
1 0 unit 206. 

Clocking floating-point unit 206 at half the frequency has the 
effect of enforcing a fifty percent (50%) maximum duty cycle on floating- 
point unit 206 during the period that it is in reduced-power mode i.e., the 
period that most-significant-bit 204 is a one. During this period, floating- 
1 5 point unit active signal 207 is true for every other cycle of system clock 
201. 

The increment magnitude and decrement magnitude used in this 
embodiment of the invention are equal; that is the contents of up/down 
counter 205 are either increased or decreased by one. Therefore, the 

20 maximum sustainable duty cycle allowed for floating-point unit 206 is 
fifty percent (50%). Therefore, if the sequence of operations being 
performed by the IC attempts to sustain a floating-point duty cycle of 
more than fifty percent (50%) for longer than the burst allowed by the 
predetermined threshold, then the floating point unit's performance is 

25 throttled down to stay within a duty cycle of fifty percent. 
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Controlling a Cache by a Programmable Activity Monitor 

Figure 3 shows an embodiment of the invention in which the 
power consumption of a cache memory is controlled in a programmable 
manner. The activity monitor in this embodiment of the invention 

5 functions as follows: Cache active signal 316 is provided from the on- 
chip cache to the select input of multiplexor 307. Based on the current 
state of cache active signal 316, either the value in active increment 
register 304 or the value in inactive decrement register 305 is presented 
as a first operand input to adder 308. A second operand input to adder 

0 308 is provided from the current value of activity-level register 309. 
Adder 308 sums the values of these two operand inputs and provides 
the result as the new current value to be stored in activity-level register 
309. 

The mode controller in this embodiment of the invention functions 
5 as follows: The control signal cache unavailable 312 is the result of 
comparator 310 determining that the contents of activity-level register 
309 are larger than the contents of threshold register 306. When it 
asserts control signal cache unavailable 312, the cache memory is put 
into its reduced-power mode, i.e. accesses to it are denied. This 
0 preferably forces the processor into an idle or wait state if it attempts to 
reference the cache when cache unavailable 312 is asserted. 

The values in threshold register 306, inactive decrement register 
305, and active increment register 304 can be programmable by a 
variety of mechanisms not shown in Figure 3. These values are the 
5 throttling parameters associated with the functional unit being 

controlled. The values in active increment register 304 and inactive 
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decrement register 305 must be of opposite signs — one must be 
negative and one positive. 

As explained above in connection with Equation 1, the values 
of active increment register 304 and inactive decrement register 305 can 
5 be selected to enforce a wide range of maximum duty cycles on the on- 
chip cache. Further, the value of threshold register 306 can be 
programmed to vary the maximum duration of bursts of high cache 
activity. This enables high performance on sequence of operations that 
require bursts of cache accesses — at least for those bursts of duration 
1 0 within tolerable power-consumption limits. 

Design Alternatives for Proarammability 

The invention is flexible in that it encompasses a wide range of 
design alternatives for programming or setting the contents of the 

1 5 throttling parameters associated with a particular functional unit, i.e. of 
threshold register 306, inactive decrement register 305, and active 
increment register 304. They could be read-only values programmed to 
the desired value like a read-only memory (ROM) by varying one or two 
of the mask layers used to fabricate the IC. They could be 

20 programmable read-only values programmed like a programmable 

read-only memory (PROM) by a one-time-only writing process such as 
blowing a fusible link for each bit. These design alternatives allow 
different versions of the IC with different power consumption and 
performance specifications. 

25 Alternatively, programming the throttling-parameter values could 

be under software control - either under control of the platform software 
*- or basic input/output system (BIOS) at system boot or power-on self test 
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(POST) time, or dynamically under control of the operating system, or 
perhaps under limited dynamic control by the applications software. If 
programmed at system boot time, the choice of values could reflect the 
power supply and heat dissipation characteristics of the system in which 
5 the IC is used. For example, substantially different values could be used 
for a portable device versus a desk-top device. If programmed by 
applications software, the choice of values could reflect the software 
setting itself a power-consumption budget rich in floating-point 
operations but meager in cache accesses, or visa versa. Preferably the 
0 hardware would enforce constraints on any throttling parameter values 
set by software so as to maintain overall power consumption within 
specifications - for example, an increased value for one parameter 
might force an automatic decrease in another. 

Alternatively, the values could be altered dynamically as the IC 
5 operates as part of coordinating power consumption across multiple 
functional units as discussed below. In this context, the current value of 
activity-level register 309 can be considered one of the throttling 
parameters that can be decreased to give its associated functional unit a 
one-time performance boost or increased to give it a temporary 
0 restriction in power consumption. 

Coordinating Power Co nsumption Across Multiple Functional 
Units 

Figure 4 is a block diagram of an embodiment of the invention 
5 for a microprocessor or other IC that includes both a data cache and an 
instruction cache, that performs speculative prefetches into the 
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instruction cache, and that disables speculative instruction-cache 
prefetching based on the recent utilization of the data cache. 

An activity-level monitor and mode controller enforce a maximum 
sustainable duty cycle on accesses to a data cache (not shown). Biocks 
and signals 301 to 316 function as do their correspondingly numbered 
counterpart in Figure 3. Thus, when the maximum sustainable duty 
cycle for data cache accesses is exceeded for more than the threshold 
duration, then the reduced-power mode is entered and the processor 
may be forced to be idle for a cycle or more if it attempts to reference the 
data cache. 

Additionally, comparator 402 compares the current value in 
activity-level register 309 with the value in prefetch threshold register 
401 . Based on the results of this comparison, the control signal throttle 
instruction-cache prefetch 403 is generated if the activity level exceeds 
this threshold. This control signal restricts speculative prefetches into 
the instruction cache (not shown), i.e. it disables all prefetches, or at 
least reduces their rate. 

The premise of this embodiment is that as long as the activity 
level in the data cache is below a threshold, then speculative prefetches 
into the instruction cache should be performed. They may speed up 
execution and they are currently affordable in terms of power 
consumption. But if the data cache activity level exceeds this threshold, 
then speculative prefetches into instruction cache should not be 
performed. The power they consume is not currently affordable. 
Further, the performance speed-up given up may be marginal because 
speculative prefetches have no benefit when the instruction accesses in 
a particular sequence of operations happen to be non-sequential. 
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Figure 5 is a block diagram of an embodiment of the invention 
that includes a power coordinator to dynamically optimize the 
power/speed tradeoff allowed by the invention across multiple functional 
units. In general, power coordinator 503 can read the activity levels 
5 associated with one or more functional units 501 within integrated circuit 
#1 and alter their throttling parameters, i.e. their associated active 
increment, inactive decrement, threshold or activity-level values. 

Integrated circuit #1 may include a number of additional 
functional units, such as Functional Unit #4, whose power consumption 

10 is not monitored — perhaps because it is constant ,or relatively small, or 
it is the preferred functional unit to throttle. A high-power functional unit 
with minimal performance impact might be a good candidate for being 
the first functional unit to throttle. 

Integrated circuit #1 may include a number of additional 

1 5 functional units, like Functional Unit #3, whose power consumption is 
not controlled — perhaps because the power consumed is relatively 
small, or there is no cost-effective technique for controlling its power 
consumption, or there is a very substantial performance impact of 
reducing its power consumption that makes it an unlikely candidate for 

20 such control. 

Based on current activity levels of any or all functional units being 
monitored, power coordinator 503 can alter the active increments and/or 
inactive decrements associated with a particular functional unit 501, 
thus changing that functional unit's maximum sustainable duty cycle. 

25 Similarly, power coordinator 503 can alter the threshold 

associated with a particular functional unit 501. This might be done to 
allow longer bursts of high activity in that particular functional unit. 
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Alternatively, this might be done in conjunction with changing that unit's 
active increment value in order to keep the maximum high-power burst 
length constant, as discussed in connection with Equation 2 above. 

Similarly, power coordinator 503 can alter the current activity 
level associated with a particular functional unit 501, thus giving that 
functional unit a one-time performance boost or power restriction. 

As shown in IC #1 in Figure 5, functional units #1 and #2 have 
an associated activity monitor & mode controller 502. Functional unit #3 
has an associated activity monitor 504, but no mode controller. 
Functional unit #4 has an associated mode controller 505, but no activity 
monitor. Activity monitor & mode controller 502, activity monitor 504 and 
mode controller 505 are different types of local power controllers, whose 
operation is coordinated by power coordinator 503 altering their 
throttling parameters. 

For example, functional unit #1 could be an instruction cache and 
functional unit #2 could be a data cache, each with an associated 
activity monitor and power controller, similar to that shown in Figure 3. 
Functional unit #3 could be a floating-point arithmetic unit. Functional 
unit #4 could be the unit that performs instruction cache prefetching, as 
discussed in conjunction with Figure 4. 

In this example, power coordinator 503 could throttle or disable 
instruction cache prefetching based on whether or not the total of the 
current activity-level values within each activity monitor & mode 
controller 502 and each activity monitor 504 exceeds a threshold. The 
premise here is that speculative instruction fetching is the first activity to 
be throttled down, because the performance penalty paid by doing that 
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is smaller than that of the reduced-power mode of the other functional 
units. 

Additionally in this example, power coordinator 503 could raise or 
lower the maximum sustainable duty cycle for each of instruction cache 
5 functional unit #1 and data cache functional unit #2 based on whether or 
not the current activity level associated with floating-point functional unit 
#3 exceeds a threshold. The premise here is that floating-point 
functional unit #3 is never throttled because the performance penalty 
paid by doing that is larger than that paid by backing off on the 

1 0 maximum duty cycles of the caches. In practice this premise is true for 
some architectures performing some types of applications, and for other 
architectures or types of applications it would be the other way around. 
The monitoring and control schemes of the present invention are flexible 
enough to accommodate a wide range of such variations. 

1 5 Additionally in this example, power coordinator 503 could raise or 

lower the maximum sustainable duty cycle for the instruction cache 
functional unit #1 and the data cache functional unit #2 based on 
whether or not the current activity level associated with the other cache 
exceeds a threshold. The premise here is that if one is relatively 

20 inactive, then a higher sustained duty cycle can be afforded in the other. 
The validity of these or any other premises about the best 
techniques for optimizing performance in the context of worst-case 
power conservation is preferably determined by profiling the realistic 
worst-case power benchmark described above. Based on such 

25 profiling, estimates and performance tradeoffs can be made between 
the normal mode and the reduced-power mode of each functional unit 
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being controlled by the invention. Each such premise can be validated 
by simulating it against the realistic worst-case power benchmark. 

Also shown in Figure 5 is system power coordinator 506 and 
integrated circuit #2, which show how the dynamic power/speed tradeoff 
5 of the invention can be hierarchically extended to the level of multiple-IC 
systems. Each integrated circuit 500 provides power consumption 
information 507 to system power coordinator 506. System power 
coordinator 506 provides power consumption commands 507 to each 
integrated circuit 500. The possible interactions between system pcwer 
1 0 coordinator 506 and each power coordinator 503 within each IC 500 are 
T analogous to the interactions discussed above between power 
jr coordinator 503 and the activity monitors/mode controllers 502, 504 and 
505. 

For example, a personal computer system could comprise a 
1 5 microprocessor IC and one or more peripheral controller ICs — a 

display controller IC, a modem communications IC, a disk controller IC, 
etc. The system power coordinator could raise or lower the power that 
the microprocessor can currently consume based on the recent 
utilization of the peripheral controller ICs. 
20 The preferred embodiment of the invention and various 

alternative embodiments and designs are disclosed herein. 
Nevertheless, various changes in form and detail may be made while 
practicing the invention without departing from its spirit and scope or 
from the following claims. 
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Claims 

We claim: 



1 1 . A microprocessor with controlled power consumption, 

2 comprising: 

3 a floating-point unit operable to compute floating point 

4 arithmetic in a normal mode and in a reduced-power mode; 

5 an activity monitor, coupled to said floating-point unit, 

6 operable to monitor the recent utilization of said floating-point 

7 unit; and 

8 a mode controller, coupled to said floating-point unit and to 

9 said activity monitor, operable to place said floating-point unit in 

1 0 said reduced-power mode when said recent utilization is greater 

1 1 than a threshold. 

1 2. The microprocessor of claim 1 , further comprising: 

2 a unit, coupled to said floating-point unit and to said mode 

3 controller, to reduce the rate at which said floating-point unit is 

4 clocked when said floating-point unit is in said reduced-power 

5 mode. 

1 3. A microprocessor with controlled power consumption, 

2 comprising: 

3 a cache memory operable to store information also stored 

4 in an external memory and operable in a normal mode and in a 

5 reduced-power mode; 
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6 an activity monitor, coupled to said cache memory, 

7 operable to produce an activity level indicative of the recent 

8 utilization of said cache memory; and 

9 a mode controller, coupled to said cache memory and to 

0 said activity monitor, operable to place said cache memory in 

1 said reduced-power mode when said activity level is greater than 

2 a threshold. 

1 4. The microprocessor of claim 3, further comprising: 

2 a unit, coupled to said cache memory and to said mode 

3 controller, operable to access said external memory when said 
4= cache memory is in said reduced-power mode. 

1 5. The microprocessor of claim 3, further comprising: 

2 a unit, coupled to said cache memory and to said mode 

3 controller, operable to throttle prefetches of said information into 

4 said cache memory when said cache memory is in said reduced- 

5 power mode. 

6. The microprocessor of claim 3 wherein said stored information 



comprises instructions, and said microprocessor further comprises: 
an instruction-execution unit, coupled to said cache 
memory, operable to execute said instructions. 



7. The microprocessor of claim 3 wherein said stored information 
comprises data, and said microprocessor further comprises: 



WO 99/17186 



28 



PCT/US97/17492 



3 a data-computation unit, coupled to said cache memory, 

4 operable to perform computations on said data. 

1 8. A microprocessor with controlled power consumption, 

2 comprising: 

3 a data-computation unit, operable to perform computations 

4 on data stored in an external memory; 

5 a data cache, coupled to said data-computation unit and to 

6 said external memory, operable to store said data; 

7 an activity monitor, coupled to said data cache, operable to 

8 indicate the recent utilization of said data cache; 

9 an instruction-execution unit, operable to execute 

1 0 instructions from said external memory; 

11 an instruction cache, coupled to said instruction-execution 

1 2 unit and to said external memory, operable to store said 

1 3 instructions and operable in a normal mode and in a reduced- 

14 power mode; and 

15 a mode controller, coupled to said instruction cache and to 

1 6 said activity monitor, operable to place said instruction cache in 

1 7 said reduced-power mode when said recent utilization is greater 

1 8 than a threshold. 

19. A microprocessor with controlled power consumption, 

2 comprising: 

3 an instruction-execution unit, operable to speculatively 

4 execute instructions; 



WO 99/17186 



29 



PCT/US97/17492 



5 an activity monitor, coupled to said instruction-execution 

6 unit, operable to indicate the recent utilization of said instruction- 

7 execution unit; and 

8 a mode controller, coupled to said instruction-execution 

9 unit and to said activity monitor, operable to throttle said 

1 0 speculative instruction execution when said recent utilization is 

1 1 greater than a threshold. 

1 10. A microprocessor with controlled power consumption, 

2 comprising: 

3 an instruction-execution unit, operable to execute 

4 instructions; 

5 an activity monitor, coupled to said instruction-execution 

6 unit, operable to indicate the recent utilization of said instruction- 

7 execution unit; and 

8 a mode controller, coupled to said instruction-execution 

9 unit and to said activity monitor, operable to throttle said 

1 0 instruction execution unit when said recent utilization is greater 

1 1 than a threshold. 



The microprocessor of claim 10 wherein 

said mode controller is operable to throttle said instruction 
execution unit by reducing the rate at which said instruction 
execution unit is throttled. 



1 11. 

2 

3 

4 



1 12. The microprocessor of claim 10 wherein 
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2 said mode controller is operable to throttle said instruction 

3 execution unit by reducing the rate at which said instruction 

4 execution unit retires instructions. 

1 13. The microprocessor of claim 10 wherein 

2 said mode controller is operable to throttle said instruction 

3 execution unit by reducing the rate at which said instruction 

4 execution unit issues instructions. 

1 14. The microprocessor of claim 10 wherein 

2 said activity monitor is operable to indicate the recent 

3 utilization of said instruction-execution unit based on its recent 

4 instruction retirement rate. 

1 15. The microprocessor of claim 10 wherein 

2 said activity monitor is operable to indicate the recent 

3 utilization of said instruction-execution unit based on its recent 

4 instruction issue rate. 

1 16. The microprocessor of claim 10 wherein 

2 said instruction execution unit is operable to predict the 

3 outcome of branch instructions; and 

4 said activity monitor is operable to indicate the recent 

5 utilization of said instruction-execution unit based on the recent 

6 accuracy rate of said branch predictions. 

1 17. An IC having controlled power consumption, comprising: 
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2 a functional unit, having throttling parameters, operable in 

3 a normal mode and in a reduced-power mode; 

4 an activity monitor, coupled to said functional unit, 

5 operable to indicate the recent utilization of said functional unit; 

6 and 

7 a mode controller, coupled to said functional unit and to 



said activity monitor, operable to place said functional unit in said 
reduced-power mode when said recent utilization is greater than 
a threshold, said throttling parameters comprising said threshold. 



1 18. The IC of claim 17, further comprising: 

2 a unit, coupled to said functional unit and to said mode 

3 controller, operable to reduce the rate at which said functional 

4 unit is clocked when said functional unit is in said reduced-power 

5 mode. 



1 19. The IC of claim 17 wherein said functional unit comprises a cache 

2 memory to store information also stored in an external memory, and said 

3 IC further comprises: 

4 a unit, coupled to said functional unit and to said mode 

5 controller, operable to access external memory when said 

6 functional unit is in said reduced-power mode. 

1 20. The IC of claim 17 wherein said functional unit comprises a cache 

2 memory to store information from an external memory, and said IC 

3 further comprises: 
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4 a unit, coupled to said functional unit and to said mode 

5 controller, operable to throttle prefetches from said external 

6 memory to said cache memory when said functional unit is in said 

7 reduced-power mode. 

1 21 . The IC of claim 17, wherein said functional unit is operable to 

2 perform speculative operations, and said IC further comprises: 

3 a unit, coupled to said functional unit and to said mode 

4 controller, operable to throttle said speculative operations when 

5 said functional unit is in said reduced-power mode. 

1 22. The IC of claim 17, wherein 

2 said functional unit is operable in cycles; and 

3 said activity monitor indicates said recent utilization by 

4 changing an activity level by a first amount for each of said cycles 

5 that said functional unit is active and changing said activity level 

6 by a second amount for each of said cycles that said functional 

7 unit is inactive, said throttling parameters further comprising said 

8 activity level, said first amount and said second amount. 

1 23. The IC of claim 22, wherein at least one of said throttling 

2 parameters is predetermined. 

1 24. The IC of claim 22, further comprising: 

2 a power coordinator operable to alter at least one of said 

3 throttling parameters. 
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1 25. An integrated circuit with controlled power consumption, 

2 comprising: 

3 a functional means for performing a function operable in a 

4 normal mode and in a reduced-power mode; 

5 a monitor means, coupled to said functional means, for 

6 calculating an activity level indicative of the recent utilization of 

7 said functional means; and 

8 a control means, coupled to said functional means and to 

9 said monitoring means, operable to place said functional means 

10 in said reduced-power mode when said activity level is greater 

1 1 than a predetermined threshold. 

1 26. A method of controlling power consumption within an integrated 

2 circuit (IC) f comprising: 

3 monitoring the recent utilization of a functional unit within 

4 said IC; 

5 comparing said recent utilization with a threshold; 

6 placing said functional unit in a normal mode when said 

7 recent utilization is less than said threshold; and 

8 placing said functional unit in a reduced-power mode 

9 when said recent utilization is greater than said threshold. 

1 27. The method of claim 26, wherein said reduced-power mode 

2 comprises reducing the rate at which said functional unit is clocked. 
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1 28. The method of claim 26, wherein said functional unit comprises a 

2 cache memory and said reduced-power mode comprises accessing 

3 external memory. 

1 29. The method of claim 26, wherein said functional unit performs 

2 speculative operations and said reduced-power mode comprises 

3 throttling said speculative operations. 

1 30. The method of claim 26, wherein said threshold is predetermined. 

1 31 . The method of claim 26 further comprising: 

2 dynamically determining said threshold. 

1 32. The method of claim 26, wherein said monitoring comprises 

2 changing an activity level by a first amount for each cycle in which said 

3 functional unit is active and changing said activity level by a second 

4 amount for each cycle in which said functional unit is inactive. 

1 33. The method of claim 31 , further comprising: 

2 dynamically determining said first amount. 

1 34. The method of claim 31 , further comprising: 

2 dynamically determining said second amount. 



1 

2 
3 



35. An integrated circuit (IC) having controlled power consumption, 
comprising: 

a first functional unit within said IC; 
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4 a second functional unit within said IC, said second 

5 functional unit being operable in a normal mode and in a 

6 reduced-power mode; 

7 an activity monitor within said IC, coupled to said first 

8 functional unit, operable to produce an activity level indicative of 

9 the recent utilization of said first functional unit; and 

10 a controller within said IC, coupled to said second 

1 1 functional unit and to said activity monitor, operable to place said 

12 second functional unit in said reduced-power mode when said 

1 3 activity level is greater than a threshold. 

1 36. An integrated circuit (IC) having controlled power consumption, 

2 comprising: 

3 a plurality of functional units each operable in a normal 

4 mode and in a reduced-power mode; 

5 a plurality of local power controllers, each associated with 

6 at least one of said functional units and each having throttling 

7 parameters, operable to control the power consumption of said 

8 associated functional unit in accordance with the current values 

9 of said throttling parameters; and 

10 a power coordinator, coupled to at least two of said local 

1 1 power controllers, operable to read a throttling parameter in a first 

12 one of said coupled local power controllers and, based thereon, 

1 3 operable to alter a throttling parameter in a second one of said 

14 coupled local power controllers. 
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1 37. A method of controlling the power consumption of an integrated 

2 circuit (IC), comprising: 

3 monitoring the recent utilization of a first functional unit 

4 within said IC to produce an activity level; and 

5 controlling the mode of operation of a second functional 

6 unit within said IC, said second functional unit being operable in 

7 a normal mode and in a reduced-power mode, by placing said 

8 second functional unit in said reduced-power mode when said 

9 activity level is greater than a threshold. 

1 38. A method of controlling power consumption of an integrated 

2 circuit (IC), comprising: 

3 controlling the mode of operation of a plurality of functional 

4 units within said IC based on a set of throttling parameters, each 

5 said functional units being operable in a normal mode and in a 

6 reduced-power mode; 

7 coordinating the power consumption of said plurality of 

8 functional units by monitoring and altering said throttling 

9 parameters. 

1 39. An integrated circuit (IC) with power consumption controllable by 

2 a processor, comprising: 

3 a functional unit operable in a normal mode and in a 

4 reduced-power mode; 

5 an activity monitor, coupled to said functional unit, having a 

6 throttling parameter, operable to generate, in accordance with 

7 said throttling parameter, an activity level indicative of the recent 
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8 utilization of said functional unit, and operable to alter said 

9 throttling parameter responsive to a command from said 

1 0 processor; and 

11 a mode controller, coupled to said functional unit and to 

1 2 said activity monitor, operable to control said mode of said 

1 3 functional unit responsive to said activity level. 

1 40. The IC of claim 39, wherein 

2 said command occurs responsive to platform software 

3 executing on said processor. 

1 41 . The IC of claim 39, wherein 

2- said command occurs responsive to operating system 

3 software executing on said processor. 

1 42. The IC of claim 39, wherein 

2 said command occurs responsive to applications software 

3 executing on said processor. 
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AMENDED CLAIMS 

[received by the International Bureau on 16 January 1998 (16 01 98)- 
original claims 1-42 replaced by new claims 1-32 (10 pages)] 

A microprocessor with controlled power consumption, comprising: 

a storage unit to store a dynamically alterable activity threshold; 
a floating-point unit operable to compute floating point 
arithmetic in a normal mode and in a reduced-power mode; 

an activity monitor, coupled to said floating-point unit, operable 
to monitor utilization of said floating-point unit; and 

a mode controller, coupled to said storage unit and to said 
activity monitor, operable to place said storage unit in said reduced- 
power mode when said utilization is greater than said activity 
threshold. 



1 2. The microprocessor of claim 1 , further comprising: 

2 a unit, coupled to said floating-point unit and responsive to a 

3 power-reduction signal from said mode controller, to reduce the rate 

4 at which said floating-point unit is clocked when said floating-point 

5 unit is in said reduced-power mode. 

1 3. A microprocessor with controlled power consumption, comprising: 

2 an activity threshold memory to store a dynamically alterable 

3 threshold; 

4 a cache memory operable to store information also stored in an 

5 external memory and operable in a normal mode and in a reduced- 

6 power mode; 

7 an activity monitor, coupled to said cache memory, operable to 

8 produce an activity level indicative of utilization of said cache memory; 

9 and 
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10 a mode controller, coupled to said activity threshold memory 

1 1 and to said activity monitor, operable to place said cache memory in 

1 2 said reduced-power mode when said activity level is greater than said 

1 3 activity threshold. 

1 4. The microprocessor of claim 3, further comprising: 

2 a unit, coupled to said cache memory and responsive to a 

3 power-reduction signal from said mode controller, operable to access 

4 said external memory when said cache memory is in said reduced- 

5 power mode. 

1 5. The microprocessor of claim 3, further comprising: 

2 a unit, coupled to said cache memory and responsive to a 

3 power-reduction signal from said mode controller, operable to throttle 

4 prefetches of said information into said cache memory when said 

5 cache memory is in said reduced-power mode. 

1 6. The microprocessor of claim 3 wherein said stored information 

2 comprises instructions, and said microprocessor further comprises: 

3 an instruction-execution unit, coupled to said cache memory, 

4 operable to execute said instructions. 

1 7. The microprocessor of claim 3 wherein said stored information 

2 comprises data, and said microprocessor further comprises: 

3 a data-computation unit, coupled to said cache memory, 

4 operable to perform computations on said data. 
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1 8. A microprocessor with controlled power consumption, comprising: 

2 a data-computation unit, operable to perform computations on 

3 data stored in an external memory; 

4 a data cache, coupled to said data-computation unit and to said 

5 external memory, operable to store said data; 

6 an activity monitor, coupled to said data cache, operable to 

7 indicate recent utilization of said data cache; 

8 an instruction-execution unit, operable to execute instructions 

9 from said external memory; 

1 0 an instruction cache, coupled to said instruction-execution unit 

1 1 and to said external memory, operable to store said instructions and 

1 2 operable in a normal mode and in a reduced-power mode; and 

13 a mode controller, coupled to said instruction cache and to said 

1 4 activity monitor, operable to place said instruction cache in said 

1 5 reduced-power mode when said recent utilization is greater than a 

16 threshold. 

1 9 A microprocessor with controlled power consumption, comprising: 

2 an instruction-execution unit, operable to speculatively execute 

3 instructions; 

4 an activity monitor, coupled to said instruction-execution unit, 

5 operable to indicate the recent utilization of said instruction-execution 

6 unit; and 

7 a mode controller, coupled to said instruction-execution unit 

8 and to said activity monitor, operable to throttle said speculative 

9 instruction execution, and not to throttle non-speculative instruction 
1 0 execution, when said recent utilization is greater than a threshold. 
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1 10. An IC having controlled power consumption, comprising: 

2 a functional unit operable in a normal mode and in a reduced- 

3 power mode; 

4 an activity monitor, coupled to said functional unit, operable to 

5 indicate utilization of said functional unit; and 

6 a mode controller, coupled to said functional unit and to said 

7 activity monitor, operable to place said functional unit in said reduced 

8 power mode when said utilization is greater than a threshold 

9 indicated by a throttling parameter associated with said functional 

1 0 unit, said throttling parameter being dynamically adjustable to alter 

1 1 said threshold. 

1 11. The IC of claim 1 0, further comprising: 

2 a unit, coupled to said functional unit and responsive to said 

3 mode controller, operable to reduce the rate at which said functional 

4 unit is clocked when said functional unit is in said reduced-power 

5 mode. 

1 12. The IC of claim 10 wherein said functional unit comprises a cache 

2 memory to store information also stored in an external memory, and said IC 

3 further comprises: 

4 a unit, coupled to said functional unit and responsive to said 

5 mode controller, operable to access external memory when said 

6 functional unit is in said reduced-power mode. 

1 13. The IC of claim 10 wherein said functional unit comprises a cache 

2 memory to store information from an external memory, and said IC further 

3 comprises: 
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4 a unit, coupled to said functional unit and responsive to said 

5 mode controller, operable to throttle prefetches from said external 

6 memory to said cache memory when said functional unit is in said 

7 reduced-power mode. 

1 14. The IC of claim 10, wherein said functional unit is operable to perform 

2 speculative operations, and said IC further comprises: 

3 a unit, coupled to said functional unit and responsive to said 

4 mode controller, operable to throttle said speculative operations when 

5 said functional unit is in said reduced-power mode. 

1 1 5. The IC of claim 1 0, wherein 

2 said functional unit is operable in cycles; and 

3 said activity monitor indicates said utilization by changing an 

4 activity level by a first amount for each of said cycles that said 

5 functional unit is active and changing said activity level by a second 

6 amount for each of said cycles that said functional unit is inactive, said 

7 throttling parameter comprising said activity level. 

1 16. The IC of claim 15, further comprising: 

2 a power coordinator operable dynamically to alter said 

3 throttling parameter. 

1 17. An integrated circuit with controlled power consumption, comprising: 

2 a functional means for performing a function operable in a 

3 normal mode and in a reduced-power mode; 

4 a monitor means, coupled to said functional means, for 

5 calculating an activity level indicative of utilization of said functional 

6 means; and 
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7 a control means, coupled to said functional means and to said 

8 monitoring means, operable to place said functional means in said 

9 reduced-power mode when said activity level is greater than a 

10 predetermined threshold, the predetermined threshold being 

1 1 dynamically adjusted. 



1 18. A method of controlling power consumption within an integrated 

2 circuit (IC), comprising: 

3 monitoring utilization of a functional unit within said IC; 

4 dynamically adjusting an activity threshold associated with the 

5 functional unit; 

6 comparing said utilization with said activity threshold; 

7 placing said functional unit in a normal mode when said 

8 utilization is less than said activity threshold; and 

9 placing said functional unit in a reduced-power mode when 
1 0 ~ said utilization is greater than said activity threshold. 

1 19. The method of claim 1 8, wherein said reduced-power mode 

2 comprises reducing the rate at which said functional unit is clocked. 

1 20. The method of claim 18, wherein said functional unit comprises a 

2 cache memory and said step of placing said cache memory in said reduced- 

3 power mode comprises diverting accesses to the cache memory to an 

4 external memory. 



1 21 . The method of claim 18, wherein said functional unit performs 

2 speculative operations and said step of placing said functional unit in said 

3 reduced-power mode comprises throttling execution of said speculative 

4 operations. 
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1 22. The method of claim 18, wherein said monitoring comprises changing 

2 an activity level by a first amount for each cycle in which said functional unit 

3 is active and changing said activity level by a second amount for each cycle 

4 in which said functional unit is inactive. 

1 23. The method of claim 22, further comprising: 

2 dynamically determining said first amount. 

1 24. The method of claim 22, further comprising: 

2 dynamically determining said second amount. 

1 25. An integrated circuit (IC) having controlled power consumption, 

2 comprising: 

3 a first functional unit within said IC; 

4 a second functional unit within said IC, said second functional 

5 unit being operable in a normal mode and in a reduced-power mode 

6 and having an activity level related to an activity level of the first 

7 functional unit; 

8 an activity monitor within said IC, coupled to said first functional 

9 unit, operable to produce an activity level indicative of the activity level 

10 of said first functional unit; and 

11 a controller within said IC, coupled to said second functional 

1 2 unit and to said activity monitor, operable to place said second 

13 functional unit in said reduced-power mode when said activity level of 

1 4 said first functional unit is greater than a threshold. 

1 26. An integrated circuit (IC) having controlled power consumption, 

2 comprising: 
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3 a plurality of functional units each operable in a normal mode 

4 and in a reduced-power mode; 

5 a plurality of local power controllers, each associated with at 

6 least one of said functional units and each having throttling 

7 parameters, operable to control the power consumption of said 

8 associated functional unit in accordance with the current values of 

9 said throttling parameters; and 

10 a power coordinator, coupled to at least two of said local power 

1 1 controllers, operable to read a throttling parameter in a first one of 

12 said coupled local power controllers and, based thereon, operable to 

1 3 alter a throttling parameter in a second one of said coupled local 

1 4 power controllers. 

1 27. A method of controlling the power consumption of an integrated circuit 

2 (IC), comprising: 

3 monitoring utilization of a first functional unit within said IC to 

4 produce a first activity level; and 

5 controlling the mode of operation of a second functional unit 

6 within said IC, said second functional unit having a second activity 

7 level related to said first activity level of said first functional unit, and 

8 being operable in a normal mode and in a reduced-power mode by 

9 placing said second functional unit in said reduced-power mode 
1 0 when said first activity level is greater than a threshold. 

1 28. A method of controlling power consumption of an integrated circuit 

2 (IC), comprising: 

3 controlling the mode of operation of a plurality of functional 

4 units within said IC based on a set of throttling parameters, each said 
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5 functional units being operable in a normal mode and in a reduced- 

6 power mode; and 

7 coordinating the power consumption of said plurality of 

8 functional units by dynamically monitoring and altering said throttling 

9 parameters. 

1 29. An integrated circuit (IC) with power consumption controllable by a 

2 processor, comprising: 

3 a functional unit operable in a normal mode and in a reduced- 

4 power mode; 

5 an activity monitor, coupled to said functional unit, having a 

6 throttling parameter, operable to generate an activity level indicative 

7 of utilization of said functional unit; 

8 a memory unit configured to store a throttling parameter, said 

9 throttling parameter being dynamically alterable by said processor; 

10 and 

11 a mode controller, coupled to said functional unit and to said 

1 2 activity monitor, operable to control said mode of said functional unit 

1 3 responsive to said activity level and to said throttling parameter. 

1 30. The IC of claim 29, wherein 

2 said command occurs responsive to platform software 

3 executing on said processor. 

1 31 . The IC of claim 29, wherein 

2 said command occurs responsive to operating system software 

3 executing on said processor. 

1 32. The IC of claim 29, wherein 
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2 said command occurs responsive to applications software 

3 executing on said processor. 
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